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Abstract. We propose a simple stochastic volatility model which is analytically tractable, 
very easy to simulate and which captures some relevant stylized facts of financial assets, 
including scaling properties. In particular, the model displays a crossover in the log-return 
distribution from power-law tails (small time) to a Gaussian behavior (large time), slow 
decay in the volatility autocorrelation and multiscaling of moments. Despite its few param- 
eters, the model is able to fit several key features of the time series of financial indexes, 
such as the Dow Jones Industrial Average, with a remarkable accuracy. 

;h 

1. Introduction 



1.1. Modeling financial assets. Recent developments in stochastic modelling of time se- 
ries have been strongly influenced by the analysis of financial assets, such as exchange rates, 
stocks, and market indexes. The basic model, that has given rise to the celebrated Black 
& Scholes formula [22l [23], assumes that the logarithm Xt of the price of the asset, after 
subtracting the trend, evolves through the simple equation 



00 

H 

^ dXt = adBt, (1.1) 

qh where a (the volatility) is a constant and {Bt)t>o is a standard Brownian motion. It has 

been well-know for a long time that, despite its success, this model is not consistent with a 
number of stylized facts that are empirically detected in many real time series, e.g.: 

1^ • the volatility is not constant and may exhibit high peaks, that may be interpreted as 

l/^ shocks in the market; 

• the empirical distribution of the increments Xt-\-h — Xt of the logarithm of the price 
called log-returns — is non Gaussian, displaying power-law tails (see Figure 



below), especially for small values of the time span h, while a Gaussian shape is 
approximately recovered for large values of h; 

• • • log-returns corresponding to disjoint time-interval are uncorrelated, but not indepen- 

. ^ dent: in fact, the correlation between the absolute values \Xt+h — Xt\ and \Xs+h — Xs\ 

^ — called volatility autocorrelation — is positive {clustering of volatility) and has a 

^ slow decay in |t — s\ (long memory), at least up to moderate values for \t — s\ (cf. 

Figure |^Kb)[](c)| below) . 



In order to account for these facts, a very popular choice in the literature of mathematical 



finance and financial economics has been to upgrade the basic model (1.1), allowing a = at 



to vary with t and to be itself a stochastic process. This produces a wide class of processes, 
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(a) Diffusive scaling of log-returns. 




(b) Multiscaling of moments. 



Figure 1. Scaling properties of the DJIA time series (opening prices 1935-2009). 



(a) The empirical densities of the log-returns over 1, 2, 5, 10, 25 days show a re- 



markable overlap under diffusive scaling. 



(b) The scaling exponent A{q) as a function of q, defined by the relation mq{h) « 
TP^'i^ (cf. (1.4)), bends down from the Gaussian behavior q/2 (red line) for q > 



g ~ 3. The quantity A{q) is evaluated empirically through a linear interpolation of 
{\ogmq{h)) versus (logh) for /i e {1, . . . , 5} (cf. section [t] for more details). 

known as stochastic volatility models, determined by the process {o't)t>o, which are able 
to capture (at least some of) the above-mentioned stylized facts, cf. [U [29] and references 
therein. 

More recently, other stylized facts have been pointed out concerning the scaling properties 
of the empirical distribution of the log-returns. Given a daily time series (sj)i<i<T over a 
period of T ^ 1 days, denote by ph the empirical distribution of the (detrended) log-returns 
corresponding to an interval of h days: 

T-h 



1 



T-h 



Xi 



log(si) - di 



(1.2) 



i=l 



where di is the local rate of linear growth of log(sj) (see section [t] for details) and 5x{-) 
denotes the Dirac measure at x G M. The statistical analysis of various financial series, such 
as the Dow Jones Industrial Average (DJIA) or the Nikkei 225, shows that, for small values 
of h, Ph obeys approximately a diffusive scaling relation (cf. Figure I^aJ): 

Ph{dr) ~ -^s-^-^^dr, (1.3) 

where g\s a, probability density with power-law tails. Considering the g-th empirical moment 
mq{h), defined by 

T-h 



mg{h) :-- 



1 



T-h 



i=l 



'ph{dr) 



(1.4) 



from relation (1.3) it is natural to guess that mq{h) should scale as /i"^/^. This is indeed what 
one observes for moments of small order q < q (with g ~ 3 for the DJIA). However, for 
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moments of higher order q > q, the different scahng relation h^^'^\ with A{q) < q/2, takes 



place, cf. Figure ]]Kb) This is the so-called multiscaling of moments, cf. [5 ^ \TI\ HUj [T7]. 

An interesting class of models that are able to reproduce the multiscaling of moments — 
as well as many other features, notably the persistence of volatility on different time scales — 
are the so-called multifractal models, like the MMAR (Multifractal Model of Asset Returns, 
cf. [13]) and the MSM (Markov-Switching Multifractal, cf. [13j). These models describe 
the evolution of the detrended log-price {Xt)t>o as a random time-change of Brownian 
motion: Xt = W7(t), where the time-change {I{t))t>o is a continuous and increasing process, 
sometimes called trading time, which displays multifractal features and is usually taken to 
be independent of the Brownian motion {Wt)t>o (cf- jl2j for more details). 

Modeling financial series through a random time-change of Brownian motion is a classi- 
cal topic, dating back to Clark [15], and reflects the natural idea that external information 
influences the speed at which exchanges take place in a market. It should be stressed that, 
under the mild regularity assumption that the time-change (/(t))t>o has absolutely contin- 
uous paths a.s., any random time-change of Brownian motion Xt = W7(f) can be written as 
a stochastic volatility model dXt = atdBt, and viceversaj^ However, a key feature of mul- 
tifractal models is precisely that their trading time {I{t))t>o has non absolutely continuous 
paths a.s., hence they cannot be expressed as stochastic volatility models. This makes their 
analysis harder, as the standard tools available for Ito diffusions cannot be applied. 

The purpose of this paper is to define a simple stochastic volatility model — or, equiv- 
alently, an independent random time change of Brownian motion, where the time-change 
process has absolutely continuous paths — which agrees with all the above mentioned styl- 
ized facts, displaying in particular a crossover in the log- return distribution from a power-law 
to a Gaussian behavior, slow decay in the volatility autocorrelation, diffusive scaling and 
multiscaling of moments. In its most basic version, the model contains only three real pa- 
rameters and is defined as a simple, deterministic function of a Brownian motion and an 
independent Poisson process. This makes the process analytically tractable and very easy 
to simulate. Despite its few degrees of freedom, the model is able to fit remarkably well 
several key features of the time series of the main financial indexes, such as DJIA, S&P 500, 
FTSE 100, Nikkei 225. In this paper we present a detailed numerical analysis on the DJIA. 

Let us mention that there are subtler stylized facts that are not properly accounted by our 
model, such as the multi-scale intermittency of the volatility profile, the possible skewness 
of the log-return distribution and the so-called leverage effect (negative correlation between 
log-returns and future volatilities), cf. [16j. As we discuss in section [sj such features — 
that are relevant in the analysis of particular assets — can be incorporated in our model 
in a natural way. Generalizations in this sense are currently under investigation, as are the 
performances of our model in financial problems, like the pricing of options (cf. A. Andreloli's 
Ph.D. Thesis [2]). In this article we stick for simplicity to the most basic formulation. 

Finally, although we work in the framework of stochastic volatility models, we point out 
that an important alternative class of models in discrete time, widely used in the econometric 
literature, is given by autoregressive processes such as ARCH, GARCH, FIGARCH and their 

^More precisely, "independent random time changes of Brownian motion with absolutely continuous time- 
change" — that is, processes {Xt)t>o such that Xt — Xq = Wi(t), where {Wt)t>o is a Brownian motion and 
(/(t))t>o is an independent process with increasing and absolutely continuous paths a.s. — and "stochastic 
volatility models with independent volatility" — that is, processes {Xt)t>o such that dXt = atdBt, where 
{Bt)t>o is a Brownian motion and {a-t)t>o is an independent process with paths in Lfo;.(R) a.s. — are the same 
class of processes, cf. [6l[29]. The link between the two representations dXt — atdBt and Xt ~ Xo — W/(t) 
is given by at = /P^ and Bt = J^{I' is))-'^'dWj^,^ = J^' {I' (J-' iv)))-'^^dW^. 
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generalizations, cf. [THl El E] • More recently, continuous-time versions have been studied as 
well, cf. [MIES]- With no aim for completeness, let us mention that GARCH and FIGARCH 
do not display multiscaling of moments, cf. [HI §8.1.4]. We have also tested the model 
recently proposed in |1U| . which is extremely accurate to fit the statistics of the empirical 
volatility, and it does exhibit multiscaling of moments. The price to pay is, however, that 
the model requires the calibration of more than 30 parameters. 

We conclude noting that long memory effects in autoregressive models are obtained 
through a suitable dependence on the past in the equation for the volatility, while large 
price variations are usually controlled by specific features of the driving noise. In our model, 
we propose a single mechanism, modeling the reaction of the market to shocks, which is the 
source of all the mentioned stylized facts. 

1.2. Content of the paper. The paper is organized as follows. 

• In section [2] we give the definition of our model, we state its main properties and we 
discuss its ability to fit the DJIA time series in the period 1935-2009. 

• In section [3] we discuss some key features and limitations of our model, point out 
possible generalizations, and compare it with other models. 

• Sections |4j [5] and [6] contain the proofs of the main results, plus some additional mate- 
rial. 

• In section [7] we discuss more in detail the numerical comparison between our model 
and the DJIA time series. 

• Finally, appendix \K\ contains the proof of some technical results, while appendix [B] is 
devoted to a brief discussion of the model introduced by F. Baldovin and A. Stella 
in [3 [31], which has partially inspired the construction of our model. 

1.3. Notation. Throughout the paper, the indexes s,t,u,x, A run over real numbers while 
i, k, m, n run over integers, so that t > means t G [0, oo) while n > means n £ {0, 1,2,.. .}. 
The symbol "~" denotes asymptotic equivalence for positive sequences (a„ ~ 6„ if and only 
if ttn/bn — )• 1 as n — )• oo) and also equality in law for random variables, like Wi ~ AA(0, 1). 
Given two real functions f{x) and g{x), we write / = 0{g) as x — )• if there exists M > 
such that |/(x)| < M|g(j;)| for x in a neighborhood of xq, while we write / = o{g) if 
f{x)/g{x) — )• as — )• xq; in particular, 0(1) (resp. o(l)) is a bounded (resp. a vanishing) 
quantity. The standard exponential and Poisson laws are denoted by Exp{\) and Po(A), 
for A > 0: X ~ Exp{X) means that P{X < x) = (1 - e-^^)l[o,oo)(a;) for all x G M while 
Y ~ Po{X) means that P(Y = n) = e~'*'A"/n! for all n G {0, 1,2,.. .}. We sometimes write 
(const.) to denote a positive constant, whose value may change from place to place. 



2. The model and the main results 

We introduce our model as an independent random time change of a Brownian motion, 
in the spirit e.g. of |15j and |3]. An alternative and equivalent definition, as a stochastic 



volatility model, is illustrated in section 2.2 



2.1. Definition of the model. In its basic version, our model contains only three real 
parameters: 

• A G (0, -|-oo) is the inverse of the average waiting time between "shocks" in the market; 

• D G (0,1/2] determines the sub-linear time change t i— )• t"^^, which expresses the 
"trading time" after shocks; 
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• cr E (0, +00) is proportional to the average volatility. 

In order to have more flexibility, we actually let o" be a random parameter, i.e., a positive 
random variable, whose distribution v becomes the relevant parameter: 

• z/ is a probability on (0,oo), connected to the volatility distribution. 

Remark 1. When the model is calibrated to the main financial indexes (DJIA, S&P 500, 
FTSE 100, Nikkei 225), the best fitting turns out to be obtained for a nearly constant a. 
In any case, we stess that the main properties of the model are only marginally dependent 
on the law v of a: in particular, the first two moments of v, i.e. E{a) and E{a'^), are 
enough to determine the features of our model that are relevant for real- world times series. 



cf. Remark 10 below. Therefore, roughly speaking, we could say that in the general case of 
random a our model has four "effective" real parameters. 

Beyond the parameters A, -D, z^, we need the following three sources of alea: 

• a standard Brownian motion W = (Wt)t>o; 

• a Poisson point process T = {Tn)n& on R with intensity A; 

• a sequence S = (cr„)n>o of independent and identically distributed positive random 
variables with law v (so that an ^ for all n); and for conciseness we denote by o" a 
variable with the same law i^. 

We assume that W, T, S are defined on some probability space P) and that they are 

independent. By convention, we label the points of T so that tq < < n. We will actually 
need only the points (t„)„>o, and we recall that the random variables (— tq), n, {Tn+i—Tn)n>i 
are independent and identically distributed with marginal laws Exp{X). In particular, 1/A is 
the mean distance between the points in T, except for tq and ri, whose average distance is 
2/A. Although some of our results would hold for more general distributions of T, we stick 
for simplicity to the (rather natural) choice of a Poisson process. 
For t > 0, we define 

i{t) := sup{n > : T„ < t} = #{T H (0, t]} , (2.1) 

so that Tj(() is the location of the last point in T before t. Plainly, i{t) ~ Po{Xt). Then we 
introduce the basic process I = {It)t>o defined by 

I, = m := 4.(t-T,(,))'^ + ^^ti(^fc-T.-i)'^ - ali-Tof'', (2.2) 



with the agreement that the sum in the right hand side is zero if i{t) = 0. More explicitly. 
{It)t>o is a continuous process with /q = and = (cj^) h?^ for < /i < (r„+i— 

We note that the derivative {jiIt)t>o is a stationary regenerative process, cf. [3]. See Figure 
for a sample trajectory of {It)t>o when D < ^. 
We then define our model X = {Xt)t>o by setting 

Xt := Wi, . (2.3) 

In words: our model is an random time change of the Brownian motion {Wt)t>o through 
the time-change process {It)t>o- Note that / is a strictly increasing process with absolutely 
continuous paths, and it is independent of W. 

When D = 2 and a is constant, we have It = a'^t and the model reduces to Black & 
Scholes with volatility a. On the other hand, when D < |, the paths of / are singular (non 
differentiable) at the points in T, cf. Figure[2| This suggests a possible financial interpretation 
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Figure 2. A sample trajectory of the process {It)t>o 



of the instants in T as the epochs at which big shocks arrive in the market, making the 
volatihty jump to infinity. This will be more apparent in next subsection, where we give a 
stochastic volatility formulation of the model. We point out that the singularity is produced 
by the sub-linear time change 1 1— )• t"^^, that was first suggested by F. Baldovin and A. Stella 
in [71 EI] (their model is described in appendix |B]) . 

2.2. Basic properties. Let us state some basic properties of our model, that will be proved 
in section El 

(A) The process X has stationary increments. 

(B) The following relation between moments of Xt and a holds: for any q > 

E{\Xt\'^) < oo for some (hence any) t>0 E{a'^) < oo . (2.4) 

(C) The process X can be represented as a stochastic volatility model: 

dXt = vtdBt, (2.5) 

where {Bt)t>o is a standard Brownian motion and {vt)t>o is an independent process, 
defined by (denoting I'{s) := g^-^(s)) 



Bt := / , dWi(s) = / , dWu , 

Jo ^ ^ Jo vW^) (2.6) 



vt := ^/Tit) = ^Gi^t) {t - n^t))' 

Note that, whenever D < ^, the volatility vt has singularities at the random times r„. 

(D) The process X is a zero-mean, square-integrable martingale (provided E{a^) < oo). 

Remark 2. If we look at the process X for a fixed realization of the variables T and S, 
averaging only on — that is, if we work under the conditional probability P( • |T, S) — 
the increments of X are no longer stationary, but the properties ([C]) and ( [Pj ) continue to 
hold (of course, condition E^a"^) < oo in (|D|) is not required under P{ ■ \T, S)). 



Remark 3. It follows from (|2.4|) that if a is chosen as a deterministic constant, then 

in 



Xt admits moments of all order (actually, even exponential moments, cf. Proposition 11 
sectionjo]). This seems to indicate that to see power-law tails in the distribution of (Xt-^-h—Xt) 
— one of the basic stylized facts mentioned in the introduction — requires to take a with 
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power-law tails. This, however, is not true, and is one on the surprising features of the simple 
model we propose: for typical choices of the parameters of our model, the distribution of 
— Xt) displays a power-law tail behavior up to several standard deviations from 
the mean, irrespective of the law of a. Thus, the eventually light tails are "invisible" for 
all practical purposes and real heavy-tailed distributions appear to be unnecessary to fit 
data. We discuss this issue below, cf. Remark [5} after having stated some results; see also 
subsection 2.4 and Figure '^[(b) for a graphical comparisons with the DJIA time series. 



Another important property of the process X is that its increments are mixing, as we 
show in section [6| This entails in particular that for every 5 > 0, k G N and for every choice 
of the intervals (oi, . . . , (a^, bk) ^ (0, oo) and of the measurable function F : 
we have almost surely 



pk 



N-1 



71=0 ^ ■ ' 

= E[F{Xi,^ — Xa^, . . . , Xftj, — Xa^.)] , 

provided the expectation appearing in the right hand side is well defined. In words: the 
empirical average of a function of the increments of the process over a long time period is 
close to its expected value. 

Thanks to this property, our main results concerning the distribution of the increments of 
the process X, that we state in the next subsection, are of direct relevance for the comparison 
of our model with real data series. Some care is needed, however, because the accessible time 



length N in (2.7) may not be large enough to ensure that the empirical averages are close 
to their limit. We elaborate more on this issue in section [7j where we compare our model 
with the DJIA data from a numerical viewpoint. 

2.3. The main results. We now state our main results for our model X, that correspond to 
the basic stylized facts mentioned in the introduction: diffusive scaling and crossover of the 
log- return distribution (Theoremj4]) ; multiscaling of moments (Theorem |6] and Corollary [T]) ; 
clustering of volatility (Theorem |8j and Corollary [9]) . 

Our first result, proved in section |4| shows that the increments (Xt-^-h — Xt) have an 
approximate diffusive scaling both when /i J, 0, with a heavy-tailed limit distribution (in 



agreement with (1.3)), and when h t oo, with a normal limit distribution. This is a precise 
mathematical formulation of a crossover phenomenon in the log-return distribution, from 
approximately heavy-tailed (for small time) to approximately Gaussian (for large time). 

Theorem 4 (Diffusive scaling). The following convergences in distribution hold for any 
choice of the parameters D, A and of the law v of a. 

• Small-time diffusive scaling: 

[Xt+h-Xt) ^ ^^^^^^ ^^^^^ (V2DA5-^)a5^-5l^i, (2.8) 

where a ^ v, ~ Exp[l) and Wi ~ AA(0, 1) are independent random variables. The 
density f is thus a mixture of centered Gaussian densities and, when D < ^, has 
power-law tails.- more precisely, if E{a^) < oo for all q > 0, 

J \x\'if{x)dx <+oo ^ K^l* ■= nljjy (2.9) 
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Large-time diffusive scaling: if E{a'^) < oo 



t+h 



-a;2/(2c2) 



dx = 7V(0, (? 



A 



^(cj2)r(2L» + l), (2.10) 



where T{a) := x"~^e~^dx denotes Euler's Gamma function. 

Remark 5. We have already observed that, when a has finite moments of all orders, for 
h > the increment (Xf^h — Xf) has finite moments of all orders too, cf. (2.4), so there are 
no heavy tails in the strict sense. However, for h small, the heavy-tailed density f{x) is by 
(2.8) an excellent approximation for the true distribution of -^{Xt+h — Xt) up to a certain 
distance from the mean, which can be quite large. For instance, when the parameters of our 
model are calibrated to the DJIA time series, these "apparent power-law tails" are clearly 
visible for h = 1 (daily log-returns) up to a distance of about six standard deviations from 
the mean, cf. subsection 2.4 and Figure '^[(b) below. 

We also note that the moment condition (2.9) follows immediately from (2.8): in fact, 
when a has finite moments of all orders. 



J \x\'if{x)dx <+oo ^ ^[5(^-1/2)9] = y 



' ds < +00 . 



(2.11) 



which clearly happens if and only if q < q* := (2 — D)^ . This also shows that the heavy 
tails of / depend on the fact that the density of the random variable S, which represents 
(up to a constant) the distance between points in T, is strictly positive around zero, and 
not on other details of the exponential distribution. 

The power- law tails of / have striking consequences on the scaling behavior of the moments 
of the increments of our model. If we set for q G (0, 00) 



mg{h) := Ei\Xt+h - Xt\i) 



(2.12) 



the natural scaling mq{h) ~ h'^^'^ as /i 4- 0, that one would naively guess from (2.8), breaks 
down for q > q*, when the faster scaling mq{h) ~ h^'^^^ holds instead, the reason being 
precisely the fact that the (7- moment of / is infinite for q > q* . More precisely, we have the 
following result, that we prove in section [4] 

Theorem 6 (Multiscaling of moments). Let q > 0, and assume E {a'') < +cxd. The quantity 
mq{h) in (2.12[) is finite and has the following asymptotic behavior as /i ^ 0; 



mq{h) 



Cqh2 



Cqhi\0g{\) 



ifq<q* 

ifq = q* 
ifq>q* 



where q* 



D) 



The constant Cg € (0, 00) is given by 

' E{\Wi\i) E{a'}) X'^/i' (2L»)'?/2r(l -g/g*) 
£;(|VFi|3)S(cj9) A(2D)9/2 



E{\WrnE{a'^)X[j;;^{{l + x 



,2D 



2 dx + 



Dq+1. 



ifq< 
'ifq = 
ifq> 



(2.13) 



where T{a) := Jq°° ^'dx denotes Euler's Gamma function. 
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Corollary 7. The following relation holds true: 

Aiq) :=lim ^°g"^f^ = |2 ' - '\ where g* := . (2.14) 

[Dq+l ^fq>q* " ^) 



The exphcit form (2.13) of the multiplicative constant Cq will be used in section [7] for the 



estimation of the parameters of our model on the DJIA time series. 

Our last theoretical result, proved in section [5| concerns the correlations of the absolute 
value of two increments, usually called volatility autocorrelation. We start determining the 
behavior of the covariance. 

Theorem 8. Assume that E{a'^) < oo. The following relation holds as h ].0, for all s,t > 0: 

Cov{\X,+h - Xs\, \Xt+h - Xt\) = — e-^l*-^l (</>(A|t -s\)h + o{h)) , (2.15) 

TT 



where 



(x) := Cov{a S""-'/^ , a {S + xf'^^^) (2.16) 



and S ~ Exp(l) is independent of a. 

We recall that p{Y, Z) := Cov{Y, Z) / ^yVar{Y)Var{Z) is the correlation coefficient of two 
random variables Y, Z. As Theorem [6] yields 

liml Vari\Xt+h-Xt\) = (20) X'-^"" Var{a \Wi\ S""-'/^) , 

hiO 11 

where S ~ Exp{l) is independent of a, Wi, we easily obtain the following result. 

Corollary 9 (Volatility autocorrelation). Assume that E{a'^) < oo. The following relation 
holds as h ^0, for all s,t > 0: 

hm p{\X,+h-Xs\,\Xt+h-Xt\) 



where is defined in (2.16) and a ^ u, S ^ Exp{l), Wi ~ A/'(0, 1) are independent 
random variables. 

This shows that the volatility autocorrelation of our process decays exponentially fast for 
time scales larger than the mean distance 1/A between the epochs r^. However, for shorter 
time scales the relevant contribution is given by the function </)(•). By ( |2.16| ) we can write 

(t>{x) = Var{a) E{S^-^I'^ {S + x)^-^l^) + E{af Cov{S^-^/^, {S + x)^-^/^) , (2.18) 

where S ~ Exp{l). When D < ^, as x — t- oo the two terms in the right hand side decay as 

EiS""''/^ {S + X)^-V2) ^ ci C0V[S''-^I\ {S + X)^-V2) ^ ,2 (2.19) 

where ci,C2 are positive constants, hence (/)(x) has a power-law behavior as x — )• oo. For 
X = 0(1), which is the relevant regime, the decay of (/'(x) is, roughly speaking, slower than 
exponential but faster than polynomial (see Figures |5||[b)| and ^[(c) ) . 
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(a) Multiscaling in the DJIA 
(1935-2009). 




400 




(b) Volatility autocorrelation 
in the DJIA (1935-2009): log 
plot. 



400 



(c) Volatility autocorrelation 
in the DJIA (1935-2009): 
log-log plot. 



Figure 3. Multiscaling of moments and volatility autocorrelation in the DJIA time 
seri es (1935-2009), compared with our model. 



(a) The DJIA empirical scaling exponent A{q) (circles) and the theoretical scaling 



exponent A{q) (line) as a function of q. 



(b) Log plot for the DJIA empirical 1-day volatility autocorrelation pi{t) (circles) 



and the theoretical prediction p{t) (line), as functions of t (days). For clarity, only 
one data out of two is plotted. 



(c) Same as (b) but log-log plot instead of log plot. For clarity, for i > 50 only one 



data out of two is plotted. 



2.4. Fitting the DJIA time series. We now consider some aspects of our model from a 
numerical viewpoint. More precisely, we have compared the theoretical predictions and the 
simulated data of our model with the time series of some of the main financial indexes (DJIA, 
S&P 500, FTSE 100 and Nikkei 225), finding a very good agreement. Here we describe in 
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Figure 4. Distribution of daily log-returns in the DJIA time series (1935-2009), 
compared with our model (cf. subsection 7.3 for details). 



(a) The density of the DJIA log-return empirical distribution pi(-) (circles) and the 



theoretical prediction pi(-) (line). The plot ranges from zero to about three standard 
deviations (s ~ 0.0095) from the mean. 



(b) Log-log plot of the right and left integrated tails of the DJIA log-return empirical 



distribution pi(-) (circles and triangles) and of the theoretical prediction pi(-) (solid 
line) . The plot ranges from one to about twelve standard deviations from the mean. 
Also plotted is the asymptotic density /(•) (dashed line) defined in equation (|2.8[). 



detail the case of the DJIA time series over a period of 75 years: we have considered the 
DJIA opening prices from 2 Jan 1935 to 31 Dec 2009, for a total of 18849 daily data. 

The four real parameters D, X, E{a), E{(t'^) of our model have been chosen to optimize 
the fitting of the scaling function A{q) of the moments (see Corollary [7]), v^^hich only depends 
on D, and the curve p{t) of the volatility autocorrelation (see Corollary [9]) , which depends 
on D, A, E{a), E{a'^) (more details on the parameter estimation are illustrated in section [?]). 
We have obtained the following numerical estimates: 



D ~ 0.16, A ~ 0.00097, E{a) ~ 0.108, 



0.0117- {E{a)y 



(2.20) 



Note that the estimated standard deviation of a is negligible, so that a is "nearly constant" . 
We point out that the same is true for the other financial indexes that we have tested. In 
particular, in these cases there is no need to specify other details of the distribution v oi a 



and our model is completely determined by the numerical values in (2.20) 



As we show in Figure [3j there is an excellent fitting of the theoretical predictions to 
the observed data. We find remarkable that a rather simple mechanism of (relatively rare) 
volatilty shocks can account for the nontrivial profile of both the multiscaling exponent A[q), 
cf. Figure ^Ka)[ and the volatility autocorrelation p{t), cf. Figure ^[(b) 



(c) 



Last but not least, we have considered the distribution of daily log-returns: Figure |4] 
compares both the density and the integrated tails of the log-return empirical distribution, 
cf. (1.2), with the theoretical predictions of our model, i.e., the law of Xi. The agreement 



is remarkable, especially because the empirical distributions of log-returns was not used for 
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the calibration the model. This accuracy can therefore be regarded as structural property of 
the model. 

In Figure ^[(b) we have plotted the density of Xi, represented by the red solid line, and 



the asymptotic limiting density / appearing in equation (2.8) of Theorem [4| represented by 



the green dashed line. The two functions are practically indistinguishable up to six standard 
deviations from the mean, and still very close in the whole plotted range. We stress that / 
is a rather explicit function, cf. equation (2.8). Also note that the log- log plot in Figure ^[b 



shows a clear power-law decay, as one would expect from / (the eventually light tails of Xi 
are invisible). 

Remark 10. We point out that, even if we had found Var(a) := E{a'^) — {E{a))'^ > 
(as could happen for different assets), detailed properties of the distribution of a are not 
expected to be detectable from data — nor are they relevant. Indeed, the estimated mean 
distance between the successive epochs (t„)„>o of the Poisson process T is 1/A ~ 1031 days. 



cf. (2.20). Therefore, in a time period of the length of the DJIA time series we are considering, 
only 18849/1031 ~ 18 variables ak are expected to be sampled, which is certainly not enough 
to allow more than a rough estimation of the distribution of a. This should be viewed more 
as a robustness than a limitation of our model: even when a is non-constant, its first two 
moments contain the information which is relevant for application to real data. 



3. Discussion and further developments 

Now that we have stated the main properties of the model, we can discuss more in depth 
its strength as well as its limitations, and consider possible generalizations. 

3.1. On the role of parameters. A key feature of our model is its rigid structure. Let 
us focus for simplicity on the case in which a is a constant (which, as we have discussed, 
is relevant for financial indexes). Not only is the model characterized by just three real 
parameters D, A, a: the role of A and a is reduced to simple scale factors. In fact, if we 
change the value of A and a in our process {Xt)t>o, keeping D fixed, the new process has 
the same distribution as {aXbt)t>o for suitable a,b (depending of X,a), as it is clear from 



the definition (2.2) of {It)t>o- This means that D is the only parameter that truly changes 
the shape (beyond simple scale factors) of the relevant quantities of our model, as it is 
also clear from the explicit expressions we have derived for the small-time and large-time 
asymptotic distribution (Theorem |4]) , mulstiscaling of moments (Theorem [g]) and volatility 
autocorrelation (Theorem |8]) . 

More concretely, the structure of our model imposes strict relations between different 
quantities: for instance, the moment q* = (^ — D)~^ beyond which one observes anomalous 



scaling, cf. (2.14), coincides with the power-law tail exponent of the (approximate) log-return 



distribution for small time, cf. (2.9), and is also linked (through D) to the slow decay of 



the volatility autocorrelation from short to moderate time of the, cf. (2.18) and (2.19). The 
fact that these quantitative constraints are indeed observed on the DJIA time series, cf. 
Figures [3] and |4j is not obvious a priori and is therefore particularly noteworthy. 

3.2. On the comparison with multifractal models. As we observed in the introduction, 
the multiscaling of moments is a key feature of multifractal models. These are random time- 
changes of Brownian motion Xf = Wj^, like our model, with the important difference that 
the time-change process {It)t>o is rather singular, having non absolutely continuous paths. 
Since in our case the time-change process is quite regular and explicit, our model can be 
analyzed with more standard and elementary tools and is very easy to simulate. 
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A key property of multifractal models, which is at the origin of their multiscahng features, 
is that the law of Xt has power-law tails, for every i > 0. On the other hand, as we already 
discussed, the law of Xt in our model has finite moments of all orders — at least when 
E{a'^) < oo for every g > 0, which is the typical case. In a sense, the source of multiscahng 
in our model is analogous, because (approximate) power-law tails appear in the distribution 
of Xt in the limit t ^ 0, but the point is that "true" power-law tails in the distribution of Xt 
are not necessary to have multiscahng properties. 

We remark that the multiscahng exponent A{q) of our model is piecewise linear with 
two different slopes, thus describing a biscaling phenomenon. Multifractal models are very 
flexible in this respect, allowing for a much wider class of behavior of A{q). It appears 
however that a biscaling exponent is compatible with the time series of financial indexes (cf. 



also Remark 14 below). 



We conclude with a semi-heuristic argument, which illustrates how heavy tails and mul- 



tiscahng arise in our model. On the event {(— tq) < h, ti > h} we can write, by (2.2), 
h = aU{h - ro)2^ - (-ro)2^} > /i^^ and therefore \Xh\ = \Wi,\ ~ Vh\Wi\ >^ . 
Consequently we get the bound 

P{\Xh\ > h"") > P((-ro) <h,n>h)>h, (3.1) 

which allows to draw a couple of interesting consequences. 

• Relation ^ yields the lower bound EdXhl") > h^iP{\Xh\ > h^) > h^''+^ on the 
moments of our process. Since Dq + 1 < q/2 for q> q* = {\ — D)~^, this shows that 
the usual scaling E{\Xh\'^) ~ /i'/^ cannot hold for q > q* ■ 

• Relation ( |3.1| ) can be rewritten as P{-^\Xh\ ^ t) > t^''* , where t = h^^2^^^ and 
q* = — D)^^ . Since t — )• +oo as /i | 0, when D < ^, this provides a glimpse of the 



appearance of power law tails in the distribution of X^ as /i | 0, cf. (2.8) and (2.9), 
with the correct tail exponent q* . 

3.3. On the stochastic volatility model representation. We recall that our process 



(Xt)t>o can be written as a stochastic volatility model dXt = vt dBt, cf. (2.5). It is interesting 
to note that the squared volatility (vt)^ is the stationary solution of the following stochastic 
differential equation: 

d{vl) = -at {vtY dt + oodi(t), (3.2) 

where we recall that {i(t))t>o is an ordinary Poisson process, while 7 is a constant and at is 
a piecewise-constant function, defined by 

_ 2D _ 1-2D 1 

^ ''~ 1-2D ^ ^' (2D)1/(1-2D) 2/(l-2D) > 0- 

i{t) 



We stress that (ft) is a pathwise solution of equation (3.2), i.e., it solves the equation for 
any fixed realization of the stochastic processes i{t) and at- The infinite coefficient of the 
driving Poisson noise is no problem: in fact, thanks to the superlinear drift term —at {vt)"' , 
the solution starting from infinity becomes instantaneously finitej^ 



The representation (3.2 ) of the volatility is also useful to understand the limitations of our 



model and to design possible generalizations. For instance, according to (3.2), the volatility 



has the rather unrealistic feature of being deterministic between jumps. This limitation 



could be weakened in various way, e.g. by replacing i{t) in (3.2) with a more general Levy 



^Note that the ordinary differential equation dx{t) = —ax{t)''dt with x{0) — 00 has the exphcit solution 



xit) = ct-i/'^-i' where c = c(a, 7) = (0(7 - l))-i/(7- 



1) 
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subordinator, and/or adding to the volatility a continuous random component. Such addition 
should allow a more accurate description of the intermittent structure of the volatility profile, 
in the spirit of multifractal models. 

In a sense, the model we have presented describes only the relatively rare big jumps of the 
volatility, ignoring the smaller random fluctuations that are present on smaller time scales. 
Besides obvious simplicity considerations, one of our aims is to point out that these big 



jumps, together with a nonlinear drift term as in (3.2), are sufficient to explain in a rather 



accurate way the several stylized facts we have discussed. 

3.4. On the skewness and leverage effect. Our model predicts an even distribution 
for Xt, but it is known that several financial assets data exhibit a nonzero skewness. A 
reasonable way to introduce skewness is through the so-called leverage effect. This can be 



achieved, e.g., by modifying the stochastic volatility representation, given in equations (2.5) 



and (3.2), as follows: 

dXt = vtdBt - /3di{t) 

dvf = —at {vf)"'dt + oodi(t), 

where /3 > 0. In other words, when the volatility jumps (upward), the price jumps downward 
by an amount 7. The effect of this extension of the model is currently under investigation. 

3.5. On further developments. A bivariate version {{Xt,Yt))t>Q of our model, where the 
two components are driven by possibly correlated Poisson point processes , , has been 
investigated by P. Pigato in his Master's Thesis p7]. The model has been numerically cal- 
ibrated on the joint time series of the DJIA and FTSE 100 indexes, finding in particular 
a very good agreement for the volatility cross-correlation between the two indexes: for this 
quantity, the model predicts the same decay profile as for the individual volatility autocor- 
relations, a fact which is not obvious a priori and is indeed observed on the real data. 

We point out that an important ingredient in the numerical analysis on the bivariate 
model is a clever algorithm for finding the location of the relevant big jumps in the volatility 
(a concept which is of course not trivially defined). Such an algorithm has been devised by 
M. Bonino in his Master's Thesis which deals with portfolio optimization problems in 
the framework of our model. 



4. Scaling and multiscaling: proof of Theorems [4] and [6] 



We observe that for all fixed t, /i > we have the equality in law X^^f^ — Xt ^ \/7/^ VFi, 
as it follows by the definition of our model {Xt)t>o = {WjJt>o- We also observe that i{h) = 
i^{T f] (0, /i]} ~ Po{Xh), as it follows from (2.1) and the properties of the Poisson process. 



4.1. Proof of Theorem |4[ Since P{i{h) > 1) = 1 



the event {i{h) = 0} = {T n (0, h] 
with —To ~ Exp[\). In particular. 



as /i I 0, we may focus on 
0}, on which we have h = (yl{{h - Tof^ - (-tq)^^). 



lim — 

hio h 



I'{0) = {2D)al{-T^ 



,2_D-1 



a.s. . 



Since XtJ 



Xt 



y/ThWi, the convergence in distribution (|2.8|) follows: 



Xt 



Vh 



^ao{-To)^-^/^Wi as/iiO. 
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Next we focus on the case h'l' oo. Under the assumption E{a'^) < oo, the random variables 
{(T|_^(rfc — Tk-i)'^^}k>i are independent and identically distributed with finite mean, hence 
by the strong law of large numbers 

hm - V al_,{n - rfc_i)2^ = E{a^)Ei{nf'') = E{a^) A-^^ T{2D + 1) a.s. . 

k=l 

Plainly, lini/i-s^+oo i{h)/h = A a.s., by the strong law of large numbers applied to the random 
variables {Tk}k>i- Recalling (2.2), it follows easily that 

hm ^ = E{a^) Ai-2^ r(2D + 1) a.s. . 

h^oo h 

Since Xt^^ — Xt ^ y/Th Wi , we obtain the convergence in distribution 

Xt+fe - Xt /E{a^) Ai-2g T{2D + 1) Wi as /i f oo , 

which coincides with ( 2.10[ ). □ 
4.2. Proof of Theorem [ej Since Xt+h — Xt ^ y/ThWi, we can write 

E{\Xt+^, - Xt\'i) = E{\h\'^/^\Win = E{\Win E{\h\'/') = CgE{\h\^/'') , (4.1) 

q 

where we set Cq := We therefore focus on E{\Ih\^), that we write as the sum of 

three terms, that will be analyzed separately: 

E{\h\l) = i?(|4|§ l{,(,)=o}) + Ei\h\' l«h)=i}) + Ei\h\'^ l{^ih)>2}) . (4.2) 

For the first term in the right hand side of (4.2), we note that P{i{h) = 0) = e~'^^ — t- 1 as 
/i i and that 4 = cjg((/i - tq)^^ - (-tq)^^) on the event {i{h) = 0}. Setting -tq =: X'^S 
with S ~ Exp{l), we obtain as /i ^ 

Ei\h\'^ 1{.W=0}) = Eia") A-^« E{{iS + Xhf^ - S^"")'^) (l + o(l)) . (4.3) 

Recalling that q* := (^ — D)^^, we have 

As (5 i we have 6-^{{S + - 5^^) t 2D S^^~^ and note that ^(S^^" 5)-?) = r(l -q/q*) 
is finite if and only if {D — ^)q > —1, that is g < q* . Therefore the monotone convergence 
theorem yields 

for q<q*: lim ^{{{3 + Xh)'''^- S^^)-) ^ ^^^^^/^ _ ^ ^ ^^^^^ 

HO Az /l2 

Next observe that, by the change of variables s = {Xh)x, we can write 



poo 

i?(((5 + A/i)2^-52^)i) = / {{s + Xhf''-s^y2e-^ds 

Jo 



(Xh) 



D,+i / ((l+x)2«-x2^)f e-^'^-'dx. 
Jo 



(4.5) 



Note that ((1 + x)^^ - x^^)2 ~ (2L»)f x^^'i^^ as x +oo and that {D - \)q < -1 if and 
only if q > q* . Therefore, again by the monotone convergence theorem, we obtain 



forq>q*: ^^^^^ = / ((1 + x)^^ -x2^)t dx G (0, oo) . (4.6) 
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Finally, in the case q = q* we have ((1 + — x^^Y*/'^ ~ {2Dy* /"^ as x ^ +00 and we 
want to study the integral in the second line of (4.5). Fix an arbitrary (large) M > and 

we 1 

(y_ 



note that, integrating by parts and performing a change of variables, as /i | we have 



dx 



M 



X 



'y 



POO 

/ (log 



x)e-^^''dx = 0(1) + 



f 00 

log , > 

l\hM 



= o(i)+ / 

From this it is easy to see that as /i | 

{{l + xf^ 



e ^ dy + log 



e ^ dy 



XhM 



poo 






log 


JxhM 








= log 







(1+0(1)). 



dx ~ (2D) ^ log - 
h 



Coming back to (4.5), noting that Dq + 1 = i for q = q* , it follows that 



lim 



i?(((5 + /l)2^-52^)V) 



hio A^9*+i/iV iog(i; 



(2D) 2 . 



(4.7) 



Recalling (4.1) and (4.3), the relations (4.4), (4.6) and (4.7) show that the first term in the 
right hand side of ( |4.2[ ) has the same asymptotic behavior as in the statement of the theorem, 
except for the regime q > q* where the constant does not match (the missing contribution 
will be obtained in a moment). 

We now focus on the second term in the right hand side of (4.2). Note that, conditionally 
on the event {i{h) = 1} = {ri < h,T2 > h}, we have 



4 = af{h-nf^Wo{{n-rof^-{-roD ~ af{h-hUf^+al(^(^hU+^^ 



2D 



2D 



where S ~ Exp{l) and ?7 ~ C/(0, 1) (uniformly distributed on the interval (0, 1)) are inde- 
pendent of (To and ai. Since P{i{h) = 1) = Xh + o{h) as /i J, 0, we obtain 



E{\h\il{^^H)=i}) = Xh^''^^E 
Since {u + x)^^ — x'^^ — )■ as x 



S_ 
Xh 



2D 



S_ 

Xh 



2Dx f 



(4.8) 



00, for every u > 0, by the dominated convergence 



theorem we have (for every q G (0, 00)) 
i?(|4|il|,(;,)=i|) 



lim 
^4,0 



XE{al)E{il-U)''^) = XE{al) 



1 



Dq + l 



(4.9) 



This shows that the second term in the right hand side of ( |4.2[ ) gives a contribution of the 
order h^'^^^ as /i J, 0. This is relevant only for q > q*, because for q < q* the first term gives 
a much bigger contribution of the order /i'?/'^ (see (4.4) and (4.7)). Recalling (4.1), it follows 
from (4.9) and (4.6) that the contribution of the first and the second term in the right hand 
side of (4.2) matches the statement of the theorem (including the constant). 

It only remains to show that the third term in the right hand side of (14. 2|) gives a negligible 



contribution. We begin by deriving a simple upper bound for I^. Since 
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for all a,b > (we recall that 2D < 1), when i{h) > 1, i.e. ri < h, we can write 

iih) 

h = 4.)(/. - r,(,))2^ + aU{r, - n..^ + ^'o [in - ro f " ("To)^^] 

^ ^2 //, ^ , 2 / ^ \2D , 2^2D 

k=2 

where we agree that the sum over k is zero if i{h) = 1. Since < h for all k < i{h), by 

the definition ( |2.1| ) of i{h), relation ( |4.10[ ) yields the bound Ih < h?^ 1^1=0 ffc, which holds 
clearly also when i{h) = 0. In conclusion, we have shown that for all h,q > 

i{h) \ g/2 



\ k=0 / 



(4.11) 



Consider first the case q > 2: hy Jensen's inequality we have 



\k=0 / \ ^ ' k=0 / k=0 

By ( |4.11| ) and ( |4.12[ ) we obtain 

ii;(|/,r/2i|,(^)>2}) < /i^'^i^(a5)ii;((i(/i) + 1)^/2 

A corresponding inequality for g < 2 is derived from (4.11 ) and the inequality (X^fcLi ^kY^"^ ^ 
X^fcli ^fe^^i which holds for every non-negative sequence (x„)„gN: 



(4.12) 



(4.13) 



/ i{h) \ 

£;(|4r/2l^,(,)>2}) < h^'Ei E^fcl{iW>2} < /i^'^i5^(a'^)i?((^(/i) + l)l«/.)>2}] 

\fc=0 / 



(4.14) 
h<l 

(4.15) 



For any fixed a > 0, by the Holder inequality with p = 2> and p' = 3/2 we can write for h <1 

E{m + iri{,(H)>2}) < E{{i{h) + if'^f"'p{i{h)>2fi^ 

< E{{i{l) + if^y^^l - e-^'' - e-^''Xhf/^ < {const.) h^/^ , 

because E{{i{l) + if'') < oo ( recal l that ijh) Poj X)) a nd (1 - e'^'' - e'^^A/i) ~ |(A/i)2 
as /i i 0. Then it follows from ( |4.13[ ) and ( |4.14| ) and ( |4.15[ ) that 

i5;(|4r/'l{.(„)>2}) < (const.') 

This shows that the contribution of the third term in the right hand side of (4.2) is always 
negligible with respect to the contribution of the second term (recall (4.9)). □ 



5. Decay of correlations: proof of Theorem [8] 

Given a Borel set / C R, we let Gi denote the cr-algebra generated by the family of random 
variables {Tk'i-{Tf,ei}j'^k^{Ti.£i})k>o- Informally, Qf may be viewed as the a-algebra generated 
by the variables Tk, (Jk for the values of k such that e /. From the basic property of the 
Poisson process and from the fact that the variables (o-fc)fc>o are independent, it follows that 
for disjoint Borel sets /, /' the a-algebras Qj, Qp are independent. We set for short Q := Q^, 
which is by definition the cr-algebra generated by all the variables {Tk)k>o and (<Tfc)fc>o, which 
coincides with the u-algebra generated by the process {It)t>o- 
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We have to prove (2.15). Plainly, by translation invariance we can set s = without loss 



of generality. We also assume that h < t. We start writing 
Cov{\Xh\,\Xt+h-Xt\) 

= Cov{E{\Xh\\g),E{\Xt+h-Xt\\g)) + E{Cov{\Xhl\Xt+h- Xt\\g)) . 



(5.1) 



We recall that Xt = Wj^ and the process {It)t>o is ^-measurable and independent of the 
process {Wt)t>o- It follows that, conditionally on {It)t>o, the process {Xt)t>o has indepen- 



dent increments, hence the second term in the right hand side of (5.1) vanishes, because 



Cov{\Xh\,\Xt+h — Xt\\G) = a.s.. For fixed h, from the equality in law Xh = Wj,^ ~ 
y/ThWi it follows that E{\Xh\\g) = ci^/h, where ci = ^(|VFi|) = y^2/Tr. Analogously 



E{\Xt+h - Xt\\g) = y^y^It+h - It and ^ reduces to 

2 



Cov{\Xh\,\Xt+h-Xt 



vr 



Cov{y/Th, It+h - It 



(5.2) 



Recall the definitions (2.1) and (2.2) of the variabes i{t) and It. We now claim that we 
can replace -s/lt+h-^t by ^/h+h — It l{Tn(/i,t]=0} iii (5.2). In fact from (2.2) we can write 



i{t+h) 

It+h - It = '^i{t+h)ii + ^ - Ti{t+h)f^ + X] ^k-ii^k - Tk-if^ - a^^^■^{t - Ti^t)f^ , 

k=i{t)+l 

where we agree that the sum in the right hand side is zero if i{t + h) = i{t). This shows 
that {It+h — It) is a function of the variables Tk,(Jk with index i[t) < k < i{t + h). 
Since {T n {h,t\ / 0} = {ri(t) > h}, this means that y%^^i^^^^t'i-{rn{h,t]^d} is g{h,t+h]- 
measurable, hence independent of \/Th, which is clearly /jj-measurable. This shows 

that Cov{\/Th, \/It+h — It l{Tn(/i,t]7^0}) = Oi therefore from (5.2) we can write 



Cov{\Xh\,\Xt+h- Xt\) = - Cov{yfTh, It+h - It l{rn(fe,i]=0}) • 
Now we decompose this last covariance as follows: 



(5.3) 



Cov{y^, ^Jlt+h - It'i- 



{Tn(h,t]=d} 



E 



h-E{^h))^It+h-Itl 



{rn(fc,t]=0} 



E 



+ E 



h-E{^/h)) ^It+h-Itl 



{rn(o,t+h]=d} 



(5.4) 



Ih-E{^)) y^It+h - It 1 



{rn(/i,t]=0}-'-{rn([o,/i]u(i,t+/i])7^0} 



We deal separately with the two terms in the r.h.s. of (5.4). The first gives the dominant 



contribution. To see this, observe that, on {Tn (0,t + h] = $} 

h = 4 [{h - Tor - {-ro?""] 

and 

It+h -It = cjI [{t + h- Tof -it- ro)2^] . 
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Since both al [{h - to)^^ - (-ro)^^] and erg [{t + h - Tof'^ - {t - ro)^^] are independent 
of {T n (0, t + /i] = 0}, we have 



E 



{rn(o,i+/i]=0} 



E 



^oV(/i-To)2^-(-ro)2^-i?(A/4)) ao^(t + h - ro)2^ - (t - ro)2^ l{rn(o,i+h]=0} 



-A(t+/i)£, 



+ 



CTo./(/l-To)2«-(-ro) 



2_D 



i^(^ao^(t + /i-ro)2^-(t-ro)2^ 

(5.5) 

Since + x)^^ — x^^) f 2Dx^^~^ as 6 \,0, by monotone convergence we obtain 



(5.6) 



hm -Cov ao\ {h - r^Y^ _ (_^^)2D^^ /(^ + /, _ ^^yn _ _ ^^)2D 



2DCov (ao(-ro)^-i/2^ao(t - to)^-!/^ 



2i:)A 



1-2D 



ct>{Xt), 



with S := A(— To) ~ Exp{l) and is defined in (2.16). Similarly 
1 



lim ^E aoxit + h- toY^ _ _ yo 



^ = ^/2D£;(c7o(^-ro)^"^/^) < +00. (5.7) 



Therefore, if we show that 



lim E 



) = V2DE (ao(-ro)^-V^) 



{51 



using (5.5), (5.6), (5.7), we have 

lim {\fh - E^/h^ a/ h+h - It l{rn(o,t+?i] = 0} 



2DX 



1-2D 



(j){Xt) (5.9) 



To complete the proof of ( |5.9[ ), we are left to show (5.8). But this is a nearly immediate 
consequence of Theorem [6j indeed, using ( 2.13| ) and the fact that q* > 1, 

Ey/h = ]— E{\Xh\) = r-^Vh + o{Vh) = V2DE(ao{-To)^-^/^)Vh + o{Vh). 



EyWA 



E\Wi\ 



The proof is now completed if we show that the second term in (5.4) is negligible, i.e. it is 
o{h). By Cauchy-Schwarz inequality and the simple fact that [■\/Th — E^/Th) ^ Ih + E{Ih) 



h — Ey/lh) \J h+h — h 1 



< E 



{rn(/i,t]=0}-'-{rn((o,/i]u(i,t+/i])^0} 



P(rn((o,/i]u(t,t + /i]) /0) 



1/2 



'h-E^Jh) {h+h -It) 

< { E [{h + E{h)) {h+h - h)] P{Tn ((0, h] u{t,t + h]) / 0))i/2 

< {2E[ll]P{Tn{{0,h]U{t,t + h])^^)f' = {2E[ll]f'V2Xh. 



(5.10) 
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By Theorem [oj E [/^] is of order /i^ if 4 < g*, and of order if 4 > g*, with a logarithmic 

correction for q* = 4. In both cases (-E [/^] ) ^^'^ ^2\h = o{h) , and the proof is completed. □ 



6. Basic properties of the model 



In this Section we start by proving the properties (^A|-(^B^-(^C|-(^D^ stated in section 2.2 



Then we provide some connections between the tails of a and those of Xt, also beyond the 
equivalence stated in (2.4). Finally, we establish a mixing property that yields relation (2.7). 
One of the proofs is postponed to the Appendix. 

We denote by Q the cr-field generated by the whole process {It)t>Oi which coincides with 
the c7-field generated by the sequences T = {Tk}k>o and S = {crk}k>o- 



h. As before, let r^^^^ be the largest 



Proof of property ([A]). We first focus on the process {It)t>o, defined in (|2.2|). For /i > let 
■.= T — h and denote the points in by = 

point of smaller that t, i.e., i^{t) 

+ E 

fc=i'>{0)+l 



H+h 



2 



t 



2D 



i{t + h). Recalling the definition (2.2), we can write 



2 



2D 



2D 



where we agree that the sum in the right hand side is zero if i^{t) = i^{0). This relation 
shows that {It+h — Ih)t>o and {It)t>o are the same function of the two random sets and 
T. Since and T have the same distribution (both are Poisson point processes on M with 
intensity A), the processes {It+h — Ih)t>o and {It)t>o have the same distribution too. 

We recall that G is the cr-field generated by the whole process {It)t>o- From the definition 
Xt = Wj^ and from the fact that Brownian motion has independent, stationary increments, 
it follows that for every Borel subset A C m[0'+°°) 

P {Xh+. -XheA) = E[P {Wi,^, -Wi^eA\g)]=P {Wi. eA) = P{X.€A), 

where we have used the stationarity property of the process /. Thus the processes {Xt)t>o and 
{Xh+t— Xh)t>o have the same distribution, which implies stationarity of the increments. □ 

Proof of property (0. Note that E{\Xt\i) = E{\Wi^\i) = E{\It\'i/'^) E{\Wi\i), by the inde- 
pendence of W and / and the scaling properties of Brownian motion. We are therefore left 
with showing that 

E(|It|9/2)<oo ^ ^(a^)<oo. (6.1) 

The implication 

\2D 



>" is easy: by the definition (2.2) of the process I we can write 



E{<r',)E{\{t 



oo. 



therefore if E{a'') = oo then also E{\It\'^/'^) 

The implication "<^" follows immediately from the bounds (4.13) and (4.14), which hold 
also without the indicator 1 



L{i(h)>2}- 



□ 



Proof of property ([C]). Observe first that I'g := g^/g > a.s. and for Lebesgue-a.e. s > 
By a change of variable, we can rewrite the process {Bt)t>o defined in (2.6) as 



1 



Vl'il-'i^)) 



1 



1 



dX, 
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which shows that relation (2.5) holds true. It remains to show that {Bt)t>o is indeed a 
standard Brownian motion. Note that 

[•It 

Bt= / ^/{I-^y{u)dWu. 







Therefore, conditionally on Q (the ci-field generated by {It)t>o)-, {Bt)t>o is a centered Gauss- 
ian process — it is a Wiener integral — with conditional covariance given by 

/■mm{7s,/t} 

Cov{Bs,Bt\G) = I {r^)'{u)du = mm{s,t} . 

Jo 

This shows that, conditionally on G, (Bt)t>o is a Brownian motion. Therefore, it is a fortiori 
a Brownian motion without conditioning. □ 



Proof of property p3|). The assumption E{a ) < oo ensures that ) < oo for all 

t > 0, as we have already shown. Let us now denote by J-^ = a{Xs, s < t) the natural 
filtration of the process X. We recall that G denotes the a- field generated by the whole 
process {It)t>o and we denote by J-^ V G the smallest cj-field containing and G- Since 
-E-lW^/t+h ~ ^it\-^i^ V ^) = for all h > 0, by the basic properties of Brownian motion, 
recalling that Xf = Wj^ we obtain 

J-/^ yG)=Xt + E(Wi^^, - Wi,\Tf V = Xt. 

Taking the conditional expectation with respect to on both sides, we obtain the mar- 
tingale property for {Xt)t>o- D 

Let us state a proposition, proved in Appendix [Aj that relates the exponential moments 
of a to those of Xt. We recall that, when our model is calibrated to real time series, like the 
DJIA, the "observable tails" of Xt are quite insensitive to the details of the distribution of 
cr, cf. Remarks [3] and [5} 

Proposition 11. Regardless of the distribution of a, for every q > {1 — D)^^ we have 

E [exp {'y\Xt\'i)] = 00 , Vt > , V7 > . (6.2) 
On the other hand, for all q < (1 — D)^^ and t > we have 

^ [exp (71X41")] < 00 V7>0 [exp (ao-^)l < 00 Va > , (6.3) 



and the same relation holds for q = {I — D) ^ provided D < ^. 

Note that (1 - D)'^ £ (1,2], because D € (0, j], so that for D < ^ the distribution of Xt 
has always tails heavier than Gaussian. 

We finally show a mixing property for the increments of our process. In what follows, for 
an interval / C [0, +00), we let 

■.= a{Xt- X,: s,tG I) 

to denote the cr-field generated by the increments in / of the process X. 

Proposition 12. Let I = [a,b), J = [c, d), with 0<a<b<c<d. Then, for every A G 
and B £ Tf 

\P{A r\B)- P{A)P{B)\ < e-^(^-^) . (6.4) 

As a consequence, equation (2.7) holds true almost surely and in , for every measurable 
function F ■.R'' such that E[\F{Xb^ - Xa^^, . . . , Xb^ - XaJ\] < +00. 
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Proof. We recall that T denotes the set {r^ : k £ Z} and, for / C M, ^j- denotes the cr-algebra 
generated by the family of random variables (Tfclj^^g/}, cjfcl|^^g/j,)fc>o, where (cr A;)fc>o is the 
sequence of volatilities. We introduce the ^[f, c)-measurable event 

T:={Tn[b, c)/0}. 

(We recall that the o"-field Qj was defined at the beginning of section [sj) We claim that, for 
we have 

p{An B nr) = P{A)P{B nr). (6.5) 

To see this, the key is in the following two remarks. 

• and Tf are independent conditionally on Q = Gr- This follows immediately from 
the independence of W and (It). As a consequence, P{AnB\g) = P{A\g)P{B\g) a.s.. 

• Conditionally to Q, the family of random variables {Xt — ^s)s,te[c,d) is a Gaussian 
process whose covariances are measurable with respect to the fi-field generated by 
the random variables {It — Ic ■ t £ (c, d)}. In particular, P{B\Q) is measurable with 
respect to this cr-field. Similarly for [a, b) in place of [c, d). Note also that the increment 
If — Ic IS a measurable function of the random variables 

{(Tfcl|^^g/},o-fcl{^^67}) : > 0} U {(cri(c),ri(c))}. 

It follows that the random variable (P(B\Q)lr) is G(b,d) measurable, and it is therefore 
independent of P{A\Q), which is G(-oo,b] measurable. 

Thus we have 

p{A n s n r) = e{p{a n B\g)ir) = E{P{A\g)P{B\G)ir) = p{A)p{b n r) 



where the two remarks above have been used. Thus (6.5) is established. Finally 

\P{Ar\B)-P{A)P{B)\ 

= \P{A n B n r) + p{A n 5 n - p{A)p{b n r) - p{A)p{b n n| 

= \P{A n B n r") - p{A)p{b n r)| = \p{A n b\t^) - p{a\t^)p{b\t^)\p{t^) 



We finally show that equation (2.7) holds true almost surely and in L^, for every measur- 
able function F ■.R'' such that E[\F{Xi,^ - Xa^,.. .^X^^ - X„J|] < +00. Consider the 
M*^-valued stochastic process ^ = (^n)neN defined by 

'■— {,Xn5-\-b\ -^nS+ai ; • • • i -^nS+bk -^nS+ak ) i 

for fixed 5 > 0, k £ N and (oi, 61), . . . , (a^, bk) ^ (0, 00). The process ^ is stationary, because 
we have proven in section [6] that X has stationary increments. Moreover, inequality (6.4) 
implies that ^ is mixing, and therefore ergodic (see e.g. |30j . Ch. 5, §2, Definition 4 and 
Theorem 2). The existence of the limit in ( |2.7[ ), both a.s. and in L^, is then a consequence 
of the classical Ergodic Theorem, (see e.g. [30], Ch. 5, §3, Theorems 1 and 2). □ 



7. Estimation and data analysis 
In this Section we present the main steps that led to the calibration of the model to the 



DJIA over a period of 75 years; the essential results have been sketched in Section 2.4, We 



point out that the agreement with the S&P 500, FTSE 100 and Nikkei 225 indexes is very 
good as well. A systematic treatment of other time series, beyond financial indexes, still has 
to be done, but some preliminary analysis of single stocks shows that our model fits well 
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some but not all of them. It would be interesting to understand which of the properties we 
have mentioned are linked to aggregation of several stock prices, as in the DJIA. 

The data analysis, the simulations and the plots have been obtained with the software 



R [21]. The code we have used is publicly available on the web page http: //www.matapp. 
unimib . it/~f caraven/c .html[ 



7.1. Overview. For the numerical comparison of our process {Xt)t>o with the DJIA time 
series, we have decided to focus on the following quantities: 

(a) The multiscaling of moments, ci. Corollary [Tj 

(b) The volatility autocorrelation decay, cf. Corollary [oj 

Roughly speaking, the idea is to compute empirically these quantities on the DJIA time 
series and then to compare the results with the theoretical predictions of our model. This is 



justified by the ergodic properties of the increments of our process {Xt)t>Q, cf. equation (2.7). 

The first problem that one faces is the estimation of the parameters of our model: the two 
scalars A E (0, oo), D E (0, ^] and the distribution v of a. This in principle belongs to an 
infinite dimensional space, but in a first time we focus on the moments E[a) and E^a"^). In 
order to estimate {D, X, E{a), E^a"^)), we take into account four significant quantities that 
depend only on these parameters: 



the multiscaling coefficients Ci and C2 (see (2.13)); 



the multiscaling exponent A{q) (see (2.14)); 



the volatility autocorrelation function p{t) (see ( |2.17 )) 



We consider a natural loss functional C = C{D, A, E{a), E{a'^)) which measures the distance 
between these theoretical quantities and the corresponding empirical ones, evaluated on the 
DJIA time series, see (7.3) below. We then define the estimator for (D, X, E(a), E{a'^)) as 
the point at which C attains its overall minimum, subject to the constraint E{a'^) > (£'(cr))^. 

It turns out that the estimated values are such that E{a^) ~ (-E(ct))2, that IS a IS 
nearly constant and the estimated parameters completely specify the model. (The constraint 
E{a'^) > {E(a))'^ is not playing a relevant role: the unconstrained minimum nearly coincides 
with the constrained one.) Thus, the problem of determining the distribution of a beyond 
its moments E{a) and E(a^) does not appear in the case of the DJIA. More generally, even 

we had found Var{a) := E{a'^) — {E{a))'^ > and hence a is not constant, fine details of its 
distribution beyond the first two moments give a negligible contribution to the properties 



that are relevant for application to real data series, as we observed in Remark 10 

7.2. Estimation of the parameters D, X, E{a), E{a'^). Let us fix some notation: the 
DJIA time series will be denoted by {si)o<i<N (where = 18848) and the corresponding 
detrended log-DJIA time series will be denoted by {xi)o<i<N- 

Xi := log(si) - d{i) , 

where d{i) := 250 Sfc=i-250 ^os('^«) mean log-DJIA price on the previous 250 days. 

(Other reasonable choices for d{i) affect the analysis only in a minor way.) 



The theoretical scaling exponent A(q) is defined in (2.14) while the multiscaling constants 
Ci and C2 are given by (2.13) for g = 1 and q = 2. Since q* = — D)~^ > 2 (we recall that 



Q < D < i), we can write more explicitly 



2^Ti\+D)Eia)X^l'^~° . s . 9^ 1 9n 

Ci = — i^-^ , C2 = 2DT{2D)E{a^)X^-^^ . (7.1) 
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Defining the corresponding empirical quantities requires some care, because the DJIA data 
are in discrete-time and therefore no /i 4- limit is possible. We first evaluate the empirical 
g- moment fhq{h) of the DJIA log- returns over h days, namely 

^ N-h 

By Theorem [6| the relation logrhq{h) ~ A{q){logh) + log(Cg) should hold for h small. 
By plotting (logmg(/i)) versus (log/i) one finds indeed an approximate linear behavior, for 
moderate values of h and when q is not too large {q < 5). By a standard linear regression 
of (log frig (/i)) versus (log/i) for h = 1,2,3,4,5 days we therefore determine the empirical 
values of A{q) and Cq on the DJIA time series, that we call A(q) and Cq. 

For what concerns the theoretical volatility autocorrelation, Corollary [9] and the station- 
arity of the increments of our process {Xt)t>o yield 

p{t) := limp(|X,|,|Xi+,-Xi|) = — — |— ^™^e-^*0(At), (7.2) 
HO TT Var[a \Wi\ ^/^) 

where S ~ Exp{l) is independent of a and Wi and where the function is given by 
^{x) = Var{a) EiS^'-'/HS + xf + E{afCov{S''-'/MS + x)''-'/^), 



cf. ( |2.18 ). Note that, although (f>{-) does not admit an explicit expression, it can be easily 



evaluated numerically. For the analogous empirical quantity, we define the empirical DJIA 
volatility autocorrelation ph{t) over h-days as the sample correlation coefficient of the two 
sequences {\xi+h - Xi\)Q<i<N-h-t and {\xi+h+t - Xi+t\)o<i<N-h-t- Since no /i | limit can 
be taken on discrete data, we are going to compare p(t) with phit) for /i = 1 day. 
We can then define a loss functional C as follows: 



(7.3) 



1 



where the constant T controls a discount factor in long-range correlations. Of course, different 
weights for the four terms appearing in the functional could be assigned. We fix T = 40 (days) 

and we define the estimator (D, A, E{a), E{a^)) of the parameters of our model as the point 
where the functional C attains its overall minimum, that is 

{DXE{a),E{^)) := argmin {C{D, X, E{a), E{a^))} , 

De(0,|], A,£;(a),B(a2)e(0,oo) 
such that _E(o-2)>{_B(cr))2 

where the constraint E{a^) > {E{a)f is due to Var{a) = E{a^) - {E{a)f > 0. We expect 
that such an estimator has good properties, such as asymptotic consistency and normality 
(we omit a proof of these properties, as it goes beyond the spirit of this paper). 

We have then proceeded to the numerical study of the functional C, which appears to 
be quite regular. With the help of the software Mathematica [26j, we have obtained the 
estimates for the parameters, given already in ( 2.20[ ): 



3-0.16, A - 0.00097, E(?)^0A08, E{^) ^ 0M17 ^ (E(7)Y (7.4) 
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7.3. Graphical comparison. Having found that E{a'^) ~ {E{a))'^ , the estimated variance 
of a is equal to zero, that is a is a constant. In particular, the model is completely specified 
and we can compare some quantities, as predicted by our model, with the corresponding 
numerical ones evaluated on the DJIA time series. The graphical results have been already 
described in section [2]4] and show a very good agreement, cf. Figure [3] for the multiscaling 
of moments and the volatility autocorrelation and Figure |4] for the log-return distribution. 

Let us give some details about Figure[4| The theoretical distribution := P{Xt G •) = 
P{Xt — Xq G •) of our model, for which we do not have an analytic expression, can be easily 
evaluated numerically via Monte Carlo simulations. The analogous quantity evaluated for 
the DJIA time series is the empirical distribution pt{-) of the sequence {xi^t — Xi)o<_i<N-t'- 

^ N-t 



N + l~t 



In Figure ^[(a) we have plotted the bulk of the distributions pt{ ) and pt{-) for t = 1 (daily 
log-returns) or, more precisely, the corresponding densities, in the range [— 3s,+3s], where 
s ~ 0.0095 is the standard deviation of pi(-) (i.e., the empirical standard deviation of the 
daily log returns evaluated on the DJIA time series). In Figure ^KB)| we have plotted the tail 
of Pi{-), that is the function z i— >• P{Xi > z) = P{Xi < —z) (note that Xt ~ —Xt for our 
model) and the right and left empirical tails R{z) and L{z) of pi(-), defined for z > by 

?( \ _ #{1 < ^ < ^ : Xi-i < -z} g _ #{1 <i<N: Xi - Xi^i > z} 
Hz)-- ^ , K[z):- - , 

in the range z E [s, 12s]. 

7.4. Variability of estimators. In this paper we have identified relatively rare but dra- 
matic shocks in the volatility as the main common source of various stylized facts such 
as multiscaling, autocorrelations and heavy tails. As observed in Remark |10[ the expected 
number of shocks in a period of 75 years is about 18, which is a rather low number; this 
means that empirical averages may be not very close to their ergodic limit or, in different 
words, estimators should have non-negligible variance. A way to detect this is to simulate 
data from our model for 75 years, and then compute estimators using data in different sub- 
periods, that we have chosen of 30 years. Figure l \^A) and I f^B) show indeed a considerable 
variability of the values of the estimators for the multiscaling exponent and the volatility 
autocorrelations, when computed in different subperiods. We have then repeated the same 
computations on the DJIA time series, see Figure ^[(c) and £|Kd) and we have observed a 
similar variability. We regard this as a significant test for this model. 

Remark 13. We point out that, among the different quantities that we have considered, 
the scaling exponent A{q) appears to be the most sensitive. For instance, if instead of the 
opening prices one took the closing prices of the DJIA time series (over the same time period 
1935-2009), one would obtain a different (though qualitatively similar) graph of A{q). 

Remark 14. The multiscaling of empirical moments has been observed in several financial 
indexes in |17) . where it is claimed that data provide solid arguments against model with 
linear or piecewise linear scaling exponents. Note that the theoretical scaling exponent A{q) 
of our model is indeed piecewise linear, cf. (2.14). However, Figure ^[(a) shows that the 
empirical scaling exponent A(q) evaluated on data simulated from our model "smooths out" 
the change of slope, yielding graphs that are analogous to those obtained for the DJIA time 
series, cf. Figure ^[(c) This shows that the objection against models with piecewise linear 
A{q), raised in ^17j, cannot apply to the model we have proposed. 
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(a) Multiscaling exponent m 
subperiods of 30 years: simu- 
lated data. 




(b) Volatility autocorrelation 
in subperiods of 30 years: 
sumilated data. 




(c) Multiscaling exponent m 
subperiods of 30 years: DJIA 
data. 
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(d) Volatility autocorrelation 
in subperiods of 30 years: 
DJIA data. 



Figure 5 . Variability of estimators in subperiods of 30 years. 

Empirical evaluation of the observables A{q) and pi{t) in subperiods or 30 years 
for a 75-years-long time series, sampled from our model {Xt)t>o ( (a) and (b) ) and 
from the DJIA time series ( (c) and (d) I 



Appendix A. Proof of Proposition [TT] 
We first need two simple technical lemmas. 

Lemma 15. For {) < q < 2, consider the function (pq : [0, +oo) — )• [0, +oo) define by 

»+oo 
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Then there are constants Ci, (72 > 0, that depend on q, such that for all (3 > 



(A.l) 



Proof. We begin by observing that it is enough to estabhsh the bounds in (A.l) for j3 large 

enough. Consider the function of positive real variable /(r) := e^^''~2''^. It is easily checked 

1 

that / is increasing for < r < {(iq) 2-9 . Thus 



{/3g)2^ 



1 



, /(r-)dr > -(/3g)^/ 



1 



(/3g)2-9 exp c(g)/32-J 



with c{q) := ^^^-g — ^q'^~i > 0. The lower bound in (A.l) easily follows for /3 large. 

1 2 

For the upper bound, by direct computation one observes that /(r) < e~^^ for r > 

(4/3) . We have: 



MP) < 



1 fi\x\)dx + 

|x|<{4/3)2=? 



1 e-3^'dx < 2(4/3)5^ 

|x|>(4/3)2=? 



+ 



e 4 dx . 



Since ||/||oo = ) = exp C{q)P^-i for a suitable C{q), also the upper bound 

follows, for (3 large. □ 

Lemma 16. Let Xi,X2, ■ ■ ■ ,Xn be independent random variables uniformly distributed in 
[0, 1], and Ui < U2 < ■ ■ ■ < Un be the associated order statistics. For n > 2 and k = 2, . . . ,n, 
set := Uk — Uk-i-Then, for every e > 



lim P 

n— >+oo 



k£{2,...,n} : ^fc > 



n 



l+e 



> n 



l-e 



1. 



Proof. This is a consequence of the following stronger result: for every x > 0, as n — )• 00 we 
have the convergence in probability 

^ <;/c G i2,...,n[ : > 

n . 



n 



/c G {2,...,n} : Cfe > -| 



see [33j for a proof. 



□ 



Proof of Proposition 11. Since Xt = Wj^ and \/ItWi have the same law, we can write 



E 



=71^*1" 



E 



exp{-flf^\Wi\'^ 



We begin with the proof of (6.3), hence we work in the regime q < (1 — D)~^, or q = 
(1 — D)~^ and < ^; in any case, q < 2. We start with the "<^" implication. Since It and 
Wi are independent, it follows by Lemma 15 that 



E 



exp(^-flf^\Wi\'^ 



< CE 



exp ( 61^ 



r2-q 



(A.2) 



bound (|4.10|) that 



for some C,6 > 0. For the moment we work on the event {i{t) > 1}. It follows by the basic 

(A.3) 



i{t) 



k=0 
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where we set 



Ti for A; = 

Tfc+i -Tk for 1 < /c < i{t) - 1 



t — T, 



i{t) 



for k = i{t) 



Note that Y^^^^o^k = t- By applying Holder inequality to (A.3) with exponents P = 215' 



P 



1 



1-2D' 



we obtain: 



< t 



2D 



'i{t) 
, fc=0 



1-2D 



2 

-2D 



By assumption q < yztq, which is the same as (1 — 2D)i^^ < 1. Thus 



_2_ 2Dq 

1^-1 < 



,fc=0 



(1-2D) 



_^ 

-215 



2-5 



< 



2Dq „ 



i{t) 



a 



_2q_ 
2-q 



(A.4) 



fc=0 



Now observe that if i{t) = we have It = al[(t - tq^^ - (-tq)^^] < alt^^ , hence 
holds also when i{t) = 0. Therefore, by (A.2) 



< CE 



exp 



i(t) 

2^^-- .2^ 



fc=0 



CE 



(A.5) 



where we have set 



p = pt:= E 



2Dq 



2q 



exp ( (5^2-9 fTg-" 



Therefore, if /9 < oo, the right hand side of (A.5) is finite, because i{t) ~ Po{Xt) has finite 
exponential moments of all order. This proves the "<^" implication in (6.3). 

The "=^" implication in (6.3) is simpler. By the lower bound in Lemma 15] we have 



E 



=71^*1" 



E 



> CE 



exp ( 6 If 



r 2 — q 



(A.6) 



for suitable C,6 > 0. We note that 



E 



exp 51^ 



1 

r2-q 



> E 
= E 



1 

r2-q 



exp {^61^ " j l{i(t)=o} 
exp (5 [{t-To?^ -{-to) 



q _22_ 
2Dl2^ ^2-q 



(A.7) 



pm = 0) 



Under the condition 



E 



_2q_ 

exp ( ao"2-9 



+00 Va > 0, 



the last expectation in (A.7) is infinite, since [{t - Tof^ - {-Tof^] > almost surely and 
is independent of ao- Looking back at (A.6), we have proved the implication in (6.3). 

Next we prove (6.2), hence we assume that q > (1 — D)~^. Consider first the case q < 2 
(which may happen only for D < By (A.6) 



E 



> CE 



exp ( 6 If 



r2-q 
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We note that, by the definition (2.1) of It, we can write 



i(t) 



It > '^Crl_^{Tk - Tk-l] 



2D 



(Ai 



k=2 



where we agree that the sum is zero if i{t) < 2. For n > 0, we let Pn to denote the conditional 
probability P{ ■ \i{t) = n) and En the corresponding expectation. Note that, under P„, the 
random variables (rfc — Tfc_i)^^2 have the same law of the random variables {S,k)'k=2 ™ 
Lemma [TgJ for n > 2. Consider the following events: 

:= {o-fe > o, VA; = 2, . . . ,n} , " '''' 



k 



,n: ^k> 



n 



> n 



where a > is such that u[[a, +oo)) =: p > and e > will be chosen later. Note that 
PniAn) = /o"~^ while Pn{Bn) — )• 1 as n — )• +00, by Lemma 16, In particular, there is c > 
such that Pn{Bn) > c for every n. Plainly, An and Bn are independent under P„. We have 



ipin) := En 



exp ( 51^ 



r2-q 



> En 



>cp"- exp 



exp ( 51^' 1 1 



n 



2D^ 

2-q 



n — 1 

c p exp 



r ^ (l-2Z)-e(l+2D))^ 



(A.9) 



Note that q > is equivalent to (1 — 2D)i^^ > 1, therefore e can be chosen small enough 
so that h ■.= {!- 2D- e(l + 2D))^ > 1. It then follows by (|X9| that ^p{n) > d exp(dn'') 
for every n G N, for a suitable d > 0. Therefore 



E 



exp ( (5/j 



r2-(j 



= +oo. 



because ~ Po{Xt) and hence £'[exp((ii(t)'')] = oo for all d > and b > 1. 
Next we consider the case q > 2. Note that 



E 



=71^*1" 



E 



exp (7/^^/^ I TV: 



(A.IO) 



hence if g > 2 we have E [e'^l"'^*!'] = 00, because £^[exp(c| VFi|'^)] = 00 for every c > 0, /t > 
almost surely and It is independent of Wi. On the other hand, if g = 2 we must have D < ^ 
(recall that we are in the regime q > {1 — D)^^) and the steps leading to (A.9) have shown 
that in this case It is unbounded. It then follows again from dAlol) that E[eTl-^'l'] = 00. □ 



Appendix B. The model of Baldovin and Stella 

Let us briefly discuss the model proposed by F. Baldovin and A. Stella OEI], motivated 
by renormalization group arguments from statistical physics. They first introduce a process 
{Yt)t>o which satisfies the scaling relation ( |1.3[ ) for a given function g, that is assumed to 
be even, so that its Fourier transform g(u) := Jj^e^^^g{x)dx is real (and even). The process 
{Yt)t>o is defined by specifying its finite dimensional laws: for h < t2 < • • • < tn the joint 
density of Yt^,Yt^,..., Yt^ is given by 

( -^ + + ^ ^1 ^2 - XI Xn-Xn-l\ ^ ^ 

p{xi,ti;X2,t2; . . .■,Xn,tn) = h [ — =, , . . . , ,, , , (B.l) 

VV'-I V '-2 ~ f^l yi^n — t^n-lj 
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where h is the function whose Fourier transform h is given by 



h{ui,U2, ■ ■■,Un) 



(B.2) 



Note that if g is the standard Gaussian density, then {Yt)t>o is the ordinary Brownian motion. 



For a non Gaussian g, the expression in (B.2) is not necessarily the Fourier transform of a 
probability on M", so that some care is needed (we come back to this point in a moment). 
However, it is clear from (B.l) that the increments of the process {Yt)t>o corresponding to 
time intervals of the same length (that is, for fixed tj+i — ti) have a permutation invariant 
distribution and therefore cannot exhibit any decay of correlations. 

For this reason, Baldovin and Stella introduce what is probably the most interesting 
ingredient of their construction, namely a special form of time-inhomogeneity. They define it 
in terms of finite dimensional distributions, bur it is simpler to give a pathwise construction: 
given a sequence of (possibly random) times < ri < r2 < • • • < r„ f +00 and a fixed 
< Z? < 1/2, they introduce a new process {Xt)t>o defined by 



Xt:=Yt2D fortE[0,Ti) 



and more generally 



Yu 



for t G [Tn,Tn 



+1) 



(B.3) 



(B.4) 



For D = 1/2 we have clearly Xt = Yt, while for D < 1/2 the process {Xt)t>o is obtained from 
iYt)t>o by a nonlinear time-change, that is "refreshed" at each time r^. This transformation 
has the effect of amplifying the increments of the process for t immediately after the times 
(Tn)n>i, while the increments tend to become small for larger t. 



Let us shed some light into the implicit relations (B.1)-(B.2). If a stochastic process 
{Yt)t>o is to satisfy these relations, it must necessarily have exchangeable increments: by 
this we mean (cf. [LQ\ p. 1210]) that, setting AY(^a,b) ■= Yb — Ya for short, the distribution 
of the random vector (AY/j+^j, . . . , AY/^^+y^) — where the I/s are intervals and y^-'s real 
numbers — does not depend on yi , . . . , y„ , as long as the intervals yi + Ii , . . . , Un + In 
are disjoint. If we make the (very mild) assumption that iYt)t>o has no fixed point of 
discontinuity, then a continuous-time version of the celebrated de Finetti's theorem ensures 
that {Yt)t>o is a mixture of Levy processes, cf. Theorem 3 in [19J (cf. also [1]). Actually, 
more can be said: since by (1.3) the distribution of the increments of {Yt)t>o is isotropic, 
i.e., it has spherical symmetry in M", by Theorem 4 in |19j the process iYt)t>Q is necessarily 
a mixture of Brownian motions. This means that we have the following representation: 



Yt = <jWt., 



(B.5) 



where iyVt)t>o is a standard Brownian motion and cr is an independent real random variable 
(a random, but time-independent, volatility). Viceversa, if a process {Yt)t>Q satisfies (|B.5|), 



then, denoting by v the law of cr, it is easy to check that relations (B.1)-(B.2) hold with 

1 



g{x) 



V2 



.2 

2(t2 



vrcr 



u{da) 



(B.6) 



or, equivalently, 



2 2 

y u 
2 



This shows that the functions g for which (B.1)-(B.2) provide a consistent family of finite 



dimensional distributions are exactly those that may be expressed as in (B.6) for some 
probability u on (0,+oo). 
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Note that a path of (B.5) is obtained by samphng independently a from v and {Wt)t>o 
from the Wiener measure, hence this path cannot be distinguished from the path of a Brown- 
ian motion with constant volatihty. In particular, the (possible) correlation of the increments 
of the process (lt)t>o cannot be detected empirically, and the same observation applies to 
the time-inhomogeneous process {Xt)t>Q obtained by {Yt)t>Q through (B.3)-(B.4). In other 
words, the processes obtained through this construction have non ergodic increments. 

Nevertheless, Baldovin and Stella claim to measure nonzero correlations from their sam- 
ples: after estimating the function g and the parameters to and D on the DJIA time series, 
their simulated trajectories show a good agreement with the clustering of volatility, as well 
as with the basic scaling (1.3) and the multiscaling of moments. The explanation of this ap- 
parent contradiction is that Baldovin and Stella do not simulate the process {Xt)t>Q defined 
through the above construction, but rather an autoregressive approximation of it. In fact, 
besides making a periodic choice of the times := ntg, they fix a small time step 5 and a 
natural number N and they first simulate X5, X25i . • • , x^s according to the true distribu- 
tion of (X^, X25, . . . , Xns)- Then they compute the conditional distribution of X(jv+i)5 given 
X25 = X25, X3S = ■ ■ ■ , Xjys = xns — thus neglecting xs — and sample a;(7v+i)5 from 
this distribution. Similarly, X(jv+2)(5 is sampled from the conditional distribution of ^(Ar+2)(5 
given Xss = x-^s, ... , X^s = xns, ^(Ar+i)5 = 2;(iv+i)5, neglecting both xg and X25, and so 
on. It is plausible that such an autoregressive procedure may produce an ergodic process. 
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